Propbank-Br: a Brazilian Treebank annotated with semantic role labels
نویسندگان
چکیده
This paper reports the annotation of a Brazilian Portuguese Treebank with semantic role labels following Propbank guidelines. A different language and a different parser output impact the task and require some decisions on how to annotate the corpus. Therefore, a new annotation guide – called Propbank-Br has been generated to deal with specific language phenomena and parser problems. In this phase of the project, the corpus was annotated by a unique linguist. The annotation task reported here is inserted in a larger projet for the Brazilian Portuguese language. This project aims to build Brazilian verbs frames files and a broader and distributed annotation of semantic role labels in Brazilian Portuguese, allowing inter-annotator agreement measures. The corpus, available in web, is already being used to build a semantic tagger for Portuguese language.
منابع مشابه
Propbank-Br: a Brazilian Portuguese corpus annotated with semantic role labels
Semantic Role Labeling is a task in Natural Language Processing often carried out through annotated corpus. So far, there is no available corpus of Portuguese annotated with semantic role labels. This paper reports the annotation of a Brazilian Portuguese corpus following Propbank guidelines. This is the first step of a larger annotation effort and aims to pave the way for a distributed annotat...
متن کاملLabeling Chinese Predicates with Semantic Roles
In this article we report work on Chinese semantic role labeling, taking advantage of two recently completed corpora, the Chinese PropBank, a semantically annotated corpus of Chinese verbs, and the Chinese Nombank, a companion corpus that annotates the predicate–argument structure of nominalized predicates. Because the semantic role labels are assigned to the constituents in a parse tree, we fi...
متن کاملAutomatic Generation of a Lexical Resource to support Semantic Role Labeling in Portuguese
This paper reports an approach to automatically generate a lexical resource to support incremental semantic role labeling annotation in Portuguese. The data come from the corpus Propbank-Br (Propbank of Brazilian Portuguese) and from the lexical resource of English Propbank, as both share the same structure. In order to enable the strategy, we added extra annotation to Propbank-Br. This approac...
متن کاملSemantic Roles for Nominal Predicates: Building a Lexical Resource
The linguistic annotation of noun-verb complex predicates (also termed as light verb constructions) is challenging as these predicates are highly productive in Hindi. For semantic role labelling, each argument of the noun-verb complex predicate must be given a role label. For complex predicates, frame files need to be created specifying the role labels for each noun-verb complex predicate. The ...
متن کاملIssues In Synchronizing The English Treebank And PropBank
The PropBank primarily adds semantic role labels to the syntactic constituents in the parsed trees of the Treebank. The goal is for automatic semantic role labeling to be able to use the domain of locality of a predicate in order to find its arguments. In principle, this is exactly what is wanted, but in practice the PropBank annotators often make choices that do not actually conform to the Tre...
متن کامل